Off-Line Arabic Handwritten Word Segmentation Using Rotational Invariant Segments Features
نویسندگان
چکیده
This paper describes a new segmentation algorithm for handwritten Arabic characters using Rotational Invariant Segments Features (RISF). The algorithm evaluates a large set of curved segments or strokes through the image of the input Arabic word or subword using a dynamic feature extraction technique then nominates a small “optimal” subset of cuts for segmentation. All the directions of stroke are converted to two main segments: '+' and w'-' RISF. A list of nominated segmentation points are prepared from the '+' segments and evaluated according to special conditions to locate the final segmentation points. The RISF algorithm was tested by using our new designed database AHD/AUST and the IFN/ENIT database. It has achieved a high segmentation rate of 95.66% on AHD/AUST and 90.58% on IFN/ENIT handwritten Arabic databases.
منابع مشابه
Radial Line Fourier Descriptor for Segmentation-free Handwritten Word Spotting
Automatic recognition of historical handwritten manuscripts is a daunting task due to paper degradation over time. Recognition-free retrieval or word spotting is popularly used for information retrieval and digitization of the historical handwritten documents. However, the performance of word spotting algorithms depends heavily on feature detection and representation methods. Although there exi...
متن کاملOff-line Arabic Handwritten Recognition Using a Novel Hybrid HMM-DNN Model
In order to facilitate the entry of data into the computer and its digitalization, automatic recognition of printed texts and manuscripts is one of the considerable aid to many applications. Research on automatic document recognition started decades ago with the recognition of isolated digits and letters, and today, due to advancements in machine learning methods, efforts are being made to iden...
متن کاملComponent-based Segmentation of Words from Handwritten Arabic Text
Efficient preprocessing is very essential for automatic recognition of handwritten documents. In this paper, techniques on segmenting words in handwritten Arabic text are presented. Firstly, connected components (ccs) are extracted, and distances among different components are analyzed. The statistical distribution of this distance is then obtained to determine an optimal threshold for words se...
متن کاملWord Spotting in Handwritten Arabic Documents Using Bag-Of-Descriptors
This paper presents a query-by-example word spotting in handwritten Arabic documents, based on Scale Invariant Feature Transform (SIFT), without using any text word or line segmentation approach, because any errors affect to the subsequent word representation. First the interest points are automatically extracted from the images using SIFT detector, then, we use SIFT descriptor to represent eac...
متن کاملOverview of Some Algorithms of Off-Line Arabic Handwriting Segmentation
We present in this paper an overview of realized works in the field of automatic segmentation of off-line Arabic handwriting. The Arabic writing is cursive in nature even printed or handwritten. The shapes of characters vary considerably according to their positions within the word. The word shapes change depending on whether letters are horizontally or vertically ligatured, i.e. superposed let...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Int. Arab J. Inf. Technol.
دوره 5 شماره
صفحات -
تاریخ انتشار 2008